Picture for Yuhui Wang

Yuhui Wang

DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training

Add code
Feb 05, 2026
Viaarxiv icon

A Unified Framework for Rethinking Policy Divergence Measures in GRPO

Add code
Feb 05, 2026
Viaarxiv icon

RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models

Add code
Feb 04, 2026
Viaarxiv icon

OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment

Add code
Jan 04, 2026
Viaarxiv icon

A data-physics hybrid generative model for patient-specific post-stroke motor rehabilitation using wearable sensor data

Add code
Dec 16, 2025
Viaarxiv icon

Synthetic Voices, Real Threats: Evaluating Large Text-to-Speech Models in Generating Harmful Audio

Add code
Nov 14, 2025
Viaarxiv icon

LLMEval-3: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon

LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation

Add code
Jun 04, 2025
Figure 1 for LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation
Figure 2 for LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation
Figure 3 for LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation
Figure 4 for LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation
Viaarxiv icon

Self-Destructive Language Model

Add code
May 18, 2025
Figure 1 for Self-Destructive Language Model
Figure 2 for Self-Destructive Language Model
Figure 3 for Self-Destructive Language Model
Figure 4 for Self-Destructive Language Model
Viaarxiv icon

AutoRAN: Weak-to-Strong Jailbreaking of Large Reasoning Models

Add code
May 16, 2025
Viaarxiv icon